Sclability of Massively Parallel Depth-First Search

نویسنده

  • Alexander Reinefeld
چکیده

We analyze and compare the scalability of two generic schemes for heuristic depthrst search on highly parallel MIMD systems. The rst one employs a task attraction mechanism where the work packets are generated on demand by splitting the donor's stack. Analytical and empirical analyses show that this stack-splitting scheme works e ciently on parallel systems with a small communication diameter and a moderate number of processing elements. The second scheme, search-frontier splitting, also employs a task attraction mechanism, but uses pre-computed work packets taken from a search-frontier level of the tree. At the beginning, a search-frontier is generated and stored in the local memories. Then, the processors expand the subtrees of their frontier nodes, communicating only when they run out of work or a solution has been found. Empirical results obtained on a 32 32 = 1024 node MIMD system indicate that the search-frontier splitting scheme incurs fewer overheadsand scales better than stack-splitting on large message-passing systems. Best results were obtained with an iterative-deepening variant that improves the work-load balance from one iteration to the next.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Depth-Bounded Discrepancy Search

Search strategies such as Limited Discrepancy Search (LDS) and Depth-bounded Discrepancy Search (DDS) find solutions faster than a standard Depth-First Search (DFS) when provided with good valueselection heuristics. We propose a parallelization of DDS: Parallel Depthbounded Discrepancy Search (PDDS). This parallel search strategy has the property to visit the nodes of the search tree in the sam...

متن کامل

Work-Load Balancing in Highly Parallel Depth-First Search

Among the various approaches for parallel depthrst search (DFS), the stack-splitting schemes are most popular. However, as shown in this paper, dynamical stack-splitting is not suitable for massively parallel systems with several hundred processors. Initial work-load imbalances and work packets of dissimilar sizes cause a high communication overhead. We compare work-load balancing strategies of...

متن کامل

Scalability of Massively Parallel Depth-first Search

We analyze and compare the scalability of two generic schemes for heuristic depthrst search on highly parallel MIMD systems. The rst one employs a task attraction mechanism where the work packets are generated on demand by splitting the donor's stack. Analytical and empirical analyses show that this stack-splitting scheme works e ciently on parallel systems with a small communication diameter a...

متن کامل

Graph-Based Substructure Pattern Mining Using CUDA Dynamic Parallelism

CUDA is an advanced massively parallel computing platform that can provide high performance computing power at much more affordable cost. In this paper, we present a parallel graph-based substructure pattern mining algorithm using CUDA Dynamic Parallelism. The key contribution is a parallel solution to traversing the DFS (Depth First Search) code tree. Furthermore, we implement a parallel frequ...

متن کامل

Massively Parallel A* Search on a GPU

A* search is a fundamental topic in artificial intelligence. Recently, the general purpose computation on graphics processing units (GPGPU) has been widely used to accelerate numerous computational tasks. In this paper, we propose the first parallel variant of the A* search algorithm such that the search process of an agent can be accelerated by a single GPU processor in a massively parallel fa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994